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Abstract 

This paper presents new low-complexity lattice-decoding algorithms for noncoherent block detection of QAM 
and PAM signals over complex-valued fading channels. The algorithms are optimal in terms of the generalized 
likelihood ratio test (GLRT). The computational complexity is polynomial in the block length; making GLRT-optimal 
noncoherent detection feasible for implementation. We also provide even lower complexity suboptimal algorithms. 
Simulations show that the suboptimal algorithms have performance indistinguishable from the optimal algorithms. 
Finally, we consider block based transmission, and propose to use noncoherent detection as an alternative to pilot 
assisted transmission (PAT). The new technique is shown to outperform PAT. 

Index Terms 

Noncoherent detection, lattice decoding, wireless communications. 

I. Introduction 

Noncoherent transmission of digital signals over unknown fading channels has recently received significant 
attention especially for the case of the block-fading channel model. Applications include recovery from deep 
fades in pilot-symbol assisted modulation based schemes, eavesdropping, and non-data-aided channel estimation. 
Noncoherent transmission is particularly applicable to systems exhibiting small coherence intervals where the use of 
training signals would result in a significant loss in throughput. Recently, some elegant information-theoretic results 
have been derived for noncoherent single and multiple-antenna systems under the assumption of Rayleigh fading, for 
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example [1,2]. Information theoretic aspects of noncoherent transmission were considered in [3] which concluded 
that at low SNR and for small coherence intervals there is a significant capacity penalty by using training. Under 
this noncoherent detection regime, it has been shown by numerical simulation that standard modulation techniques 
such as quadrature amplitude modulation (QAM) can achieve near-capacity in the single-antenna noncoherent block 
Rayleigh-fading channel [4]. 

This paper focuses on noncoherent receiver design for the block fading channel. A great deal of work has been 
performed on partially coherent receivers such as pilot-symbol assisted modulation (PSAM) [5,6], per-survivor 
techniques [7], and coupled estimators [8]. However the challenge remains to develop high-performance, low- 
complexity, fully noncoherent receivers. 

Various suboptimal algorithms have been proposed for block-based noncoherent detection. For slowly fading 
channels, a blind phase recovery approach was proposed in [9] for noncoherent detection of differentially encoded 
QAM [10] where the attenuation was assumed to be known exactly at the receiver. In [11], a suboptimal technique 
for PSK was proposed which involved forming a number of equally spaced channel phase estimates. An extension 
to multi-amplitude constellations was also presented, where every sequence of symbol amplitudes is considered, 
and then the PSK technique is applied to determine the phase of the symbols. Unfortunately, the complexity of this 
suboptimal approach is still exponential in the sequence length, albeit with a smaller base. 

Recently, lattice decoding algorithms have been applied to noncoherent and differential detection. For PSK 
over temporally-correlated Rayleigh fading channels, a form of lattice decoding (namely sphere-decoding) can be 
applied since it turns out that the detection metric is Euclidean [12]. Lattice decoding techniques have also been 
used for differential detection of diagonal space-time block codes over Rayleigh fading channels, by approximating 
the decision metric with a Euclidean metric [13, 14]. In [15], we presented simulation results for another lattice 
projection approach for suboptimal PAM and QAM detection. Unfortunately, each of these algorithms require 
complexity exponential with the block length to guarantee that the optimal estimate is found [16,17]. Practical 
implementation considerations demand that low complexity algorithms be developed. 

For the case of the constant envelope PSK constellation, a detection algorithm with complexity 0(T log T) was 
developed in [18, 19] (where T is the block length), which can provide the optimal data estimate over an unknown 
noncoherent fading channel, in terms of the Generalized Likelihood Ratio Test (GLRT). The GLRT is equivalent to 
joint ML estimation of a continuous valued channel parameter and discrete-valued data parameters. This approach 
was generalized in [20], where they outlined a general graph-based approach which involved forming a spanning 
tree. Specific details were presented for the cases of QAM over a phase noncoherent channel {i.e. known channel 
amplitude) [20,21], and for PSK over a fading channel with coding [22]. The challenge remains to develop efficient 



IEEE TRANSACTIONS ON SIGNAL PROCESSING, ACCEPTED TO APPEAR (ACCEPTED NOV. 2006) 3 

algorithms for optimal noncoherent sequence detection of multi-amplitude constellations over fading channels. 

In this paper we propose a new GLRT-optimal noncoherent lattice decoding approach for QAM and PAM symbols 
which has complexity polynomial in the block length. We start by considering detection of M-ary PAM over real- 
valued fading channels (which we shall term real-PAM). We show that the GLRT-optimal codeword estimate is the 
closest codeword (or lattice point) in angle to the line described by the received vector. We propose an algorithm 
that searches along the line, and chooses the best codeword estimate from this search. We provide a theorem that 
bounds the search to a segment of the line, limiting the number of codewords that need to be considered. We show 
how the search can be done in an iterative manner, and that the resulting complexity of the algorithm is 0(T log T). 

We then consider the more practical case of M-ary QAM detection over complex-valued fading channels, and 
show that in this case the GLRT-optimal codeword estimate is the closest codeword in angle to a plane described 
by the received vector. We propose an algorithm that searches across the plane, and chooses the best codeword 
estimate from this search. We provide a theorem that bounds the search to a segment of the plane. We show how 
the extent of the search can be further reduced by exploiting the rotational symmetry of the constellation. The 
resulting plane search algorithm can be performed with complexity of order 0(T 3 ). 

We also present new suboptimal noncoherent QAM detection algorithms with even lower complexity; by com- 
bining a channel phase estimator with our fast real-PAM algorithm. We propose using 0(T) instances of the 
real-PAM algorithm. This approach therefore has complexity of order 0(T 2 logT). Simulations indicate that there 
is a negligible performance loss compared to GLRT, when using this suboptimal technique. 

Finally, we also propose a pilot-assisted version of our new reduced-search noncoherent lattice-decoding algo- 
rithms. The pilot symbol is used to remove the ambiguities inherent with noncoherent detection. Our approach 
obtains improved performance compared with standard pilot assisted transmission [6], while maintaining the same 
data rate. 

II. System Model 

A. Signal Model 

We define a codebook C(X, T) as the set of all possible sequences of T transmitted symbols, x = [x\, . . . , xt]', 
such that each x t is in some constellation X. For an M-ary PAM constellation, X = {±1, ±3, . . . , ±(M — 1)}. For 
QAM, X is a subset of the Gaussian (complex) integers with odd real and imaginary components. For example, for an 
M 2 -ary square QAM constellation, X = { x | Re {x} e X' , Im {x} e X'} where X' = {±1, ±3, . . . , ±(M - 1)}. 
Thus each codebook C(X, T) is a set of lattice points drawn from a subset of the unit lattice of K T or C T . 

We consider block fading channels and assume that the channel h is constant for at least T symbols as in [1-4, 
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11]. We will consider narrowband fading channels where h is either real-valued and complex-valued channels. Thus 
we can write the received codeword y = [ y-y, . . , , y-r ]' as follows, 

y = hx + n (1) 

where n = [ n\, . . . , ut ]' is a vector of additive white Gaussian noise. 

B. Detection 

The noncoherent detection problem is to estimate x based on y without knowledge of the channel and in the 
absence of training data. The log-likelihood function of the maximum likelihood (ML) detector (of both channel 
and data) is given by 

i(y;x,/ l ) = -||y-/ l x|| 2 (2) 

where constant factors have been discarded and || || represents the Euclidean norm. For a given codeword hypothesis 
x, the likelihood function is maximized by choosing 

x^y 

h = (3) 
l|x|| 2 

where (-)t denotes Hermitian transpose. 

Hence, the ML estimate of x conditioned on the corresponding channel estimate, is given by 

-opt r ( • *V 1 

x F = arg max L I y; x, 



xee(*,T) \- ■ ||x|| 2 



|x f y 



2 



arg max - — — (4) 



*eC(*,T) ||x|[ 

This is the Generalized Likelihood Ratio Test (GLRT) [23] considered in [11,20]. 
Note that (J4j is equivalently given by 



x T y 

x op = arg max „ — ■ — ^ (5) 



■*ec(*,r) ||x|| 2 ||y|| 2 
= arg max cos 2 #(x,y) (6) 

iteC(X.T) 

where (9(x, y) is the principal angle between x and y [24]. Thus x opt can be found by searching the points of 
C(X,T) to find the one closest in angle to y. 

For QAM, we can also obtain a geometric interpretation of by expressing the complex vectors in M. 2T . We 
will use the underscore notation x to denote the mapped version of x as follows, 

x = [ Re {zi} Im {m} ... Re {x T } Im {x T }]' (7) 
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and denote the real-valued codebook as Cr(X,T) = {x | x £ C}. For M 2 -ary square QAM, we therefore have 
C R {X,T) = C(X',2T) where X' is an M-ary PAM constellation. We also define Y G M 2Tx2 as a basis for the 
subspace yC mapped into the real space R 2T ; that is 

Re {yi} Im{yi} ... Re{y T } lm{y T } 
Im{yi} Rc{yi} ... -Im{y T } Re{y T } 
Note that the columns of Y are orthogonal. The projection matrix P(y) G R 2Tx2T is defined as 

YY' 

P(y) 4 (9) 

llyll 2 



Y = 



(8) 



such that 



P(y)x = arg min ||x - v|| . 



That is, the vector P(y)x is the projection of x onto the subspace YK 2 . Now, it can be easily shown that 

x opt = arg max cos 2 0(x, P(y)x) 

x:x£C(A\T) 

Thus the GLRT-optimal data estimate x opt , corresponds to the x G Cr(X, T) closest in angle to the plane Yl 2 . 

It is important to note that two forms of ambiguity exist for this noncoherent detection problem. The first is 
the well-known phase ambiguity which occurs for any constellation that is invariant to a particular phase rotation. 
For example, for square QAM constellations the following four optimal channel estimate and codeword pairs have 
the same likelihood: (h ? 1 , x ?'), (-/i opt , -x opt ), {-ih ? 1 , ix opt ) and (ih ^, -ix ? 1 ); corresponding to the four tt/2 
rotations of the constellation. We will assume that this type of ambiguity can be resolved, for example, by using 
the phase of the last symbol from the previous codeword [4], or by using differential encoding [10]. The second 
type of ambiguity we call a divisor ambiguity and arises when there are multiple points in C(X,T) that lie on 
the same 1-dimensional (real or complex) subspace e.g. [1, 1, 1] and [3, 3, 3] for 4-ary real-PAM with T = 3. This 
produces a lower bound on the noncoherent block detection error rate as discussed and analyzed in [25]. 

III. Reduced Search Space 

In this section we show that the GLRT-optimal data estimate x opt , can be found without testing all the elements of 
C(X,T). In the previous section we established that x opt is the codeword closest in angle to a particular subspace, 
so it naturally makes sense to define a 'nearest neighbor set' of the subspace and search within that set. The 
subspace of interest has basis vector y and passes through the origin. We show that the nearest neighbor set for this 
subspace contains x opt . This implies that low complexity decoding algorithms can be developed, based on finding 
this particular nearest neighbor set, and searching it. 
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Definition 1: We define NN(v) to be the point, or set of points, in C(X, T) closest to the arbitrary point v {i.e., 
the nearest neighbor to v). That is, d is an element of NN(v) if ||v — d|| ^ ||v — z|| for all z e C(X,T). 
Of course, usually NN(v) will have a single element, and in this case we can write NN(v) = d. 

Definition 2: We define Af(C(X, T),y) to be the nearest neighbor subset of the codebook C(X, T), corresponding 
to the subspace with basis vector y, passing through the origin. That is, u e Af(C(X,T),y) if and only if there 
exists some A such that NN(Xy) = u. 

Note that from a geometrical perspective, it is useful to think of A as being equivalent to the inverse of a channel 
estimate; implying that a point u is in the nearest neighbor set if there is a channel estimate h such that the distance 
|y — hu\ is smaller than for any other point. Consequently we define A opt = (/i opt )~ 1 as the reciprocal of the optimal 
channel estimate. 

The following property of the GLRT-optimal codeword estimate x opt , allows us to reduce the set of codewords 
which need to be tested, to a small subset of the \X\ possible codewords (where |-| denotes set cardinality). Note 
that an equivalent result was presented in [20], however the geometrical interpretation of our formulation is more 
apparent; and is important when developing our new search algorithms later. 

Property 1: 

±°VeN(C{X,T),y). 

Proof: Consider the case where x opt ^ Af(C(X,T),y). From Definition [2] this implies that there exists some 
x e Af(C(X,T),y) such that ||A opt y-x|| < ||A opt y-x opt || however this would imply L(y; x, /i opt ) > i(y;x opt , h opt ) 
from d2J and hence we have a proof by contradiction. ■ 

IV. PAM Detection For Real- Valued Fading Channels 

This section presents a low complexity algorithm for GLRT-optimal noncoherent PAM detection over real-valued 
channels. Practically, such channels arise in baseband transmission (eg. multi-level PCM), or in certain bandpass 
systems where phase and frequency are separately estimated by a phase-locked loop. 

We first present a theorem that we will use to reduce the number of codewords that need to be examined, even 
beyond the limitations imposed by Property Q] Note that in this real-valued channel case, the subspace of interest 
(defined by y) actually reduces to a line, yR. The theorem implies that only a limited extent of the line needs to 
be searched; and that the extent depends on the largest value of y. We then propose a fast low-complexity iterative 
algorithm to perform the search. In the sequel, we will extend the algorithm to complex-valued channels. Later, we 
will directly incorporate the algorithm from this section into an extremely low complexity suboptimal algorithm for 
noncoherent detection over the more commonly encountered complex-valued channels. 
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A. Limiting the Search Space 

Theorem 1: For noncoherent detection of M-ary PAM codewords of length T over a real- valued fading channel 

|A opt y t | ^M + T-2 

for all t = 1,2,..., T. 

Proof: Define n = argmax t {|A opt y t |}. Note that if |A opt y„| M then the theorem is satisfied. 
Now, consider the alternative case when |A opt y„| > M. Rearranging the GLRT-optimal channel estimate in <[3j 
gives (x opt )'(y - h opt X opt ) = and hence 

(x opt )'(A opt y-x opt ) =0. (10) 

We will use this property to bound A opt . Using (0 and the fact that C(X,T) contains all possible sequences 
{ x | xt £ X V< }, the elements of x opt can be determined on an element-wise basis as 

x° p = arg min I A opt y t — x I . (11) 

For the case we are considering where |A opt j/„| > M, it follows that since the largest PAM constellation values are 
±(Af — 1), that x° pt — sgn{y n }(M — 1) where sgn is the signum function. 
We now substitute (fTTb into ( fTOb to bound A opt , which gives 

£T (\° pt yn - &t) = E zTi^yt - d2) 

Now ( fTTT i and the symmetry of the PAM constellation implies that sgn{i° pt } = sgn {\ opt y t }. Moreover, since 
i° pt € {±1, ±3, . . . , ±(M - 1)} it follows form the definition of A opt that -1 < X opt y t -x° pt < 1 for all x° pt except 
±(M - 1). More generally, for all i° pt , sgn {x° pt } (\ opt y t - x° pt ) ^ -1 and hence i° pt (A opt y t - x° pt ) ^ -\x° pt \. 
Substituting this into ( fT2l gives 

xT(X opl y n -xT)^Y,K l \ 

and hence 



|A op, y„ 



—y\ 

-opt 

Xn t^n 



^ M - i+(T "M (M r 1)=M+T - 2 - 



Therefore, since |A opt i/ t | ^ |A opt y n | for all t the theorem is proved. 
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B. PAM Algorithm 

In this Section we use Property Q] and Theorem[T]to develop a low-complexity algorithm for real-PAM detection. 
This algorithm reduces the number of codewords for which the decision metric is evaluated to order MT, which 
is much smaller than the set of all possible M T codewords that would be considered by an exhaustive search. 
Furthermore, we demonstrate how the algorithm can be implemented in an iterative manner so that the complexity 
is 0(T log T). 

Property Q] implies that x opt can be found by calculating the metric in for only those x € J\f(C(X,T),y), 
i.e. for only those x G C for which the line yR passes through its Euclidean nearest neighbor region. Furthermore, 
Theorem Q] implies that only a finite segment of the line need be considered. We have demonstrated such a search in 
Figure Q] which shows the positive axes for 8-ary PAM with T = 2, where the shaded regions indicate the nearest 
neighbor regions of the points which need to be searched. The specifics of the algorithm are as follows. 

First, for ease of notation we modify the received codeword y by changing the signs of all negative elements in 
y. This will mean that the corresponding (modified) x opt will now have all positive elements. The true (original) 
GLRT estimate of x can be obtained by applying the reverse sign changes to x opt . Observe that we can do this 
without loss of generality since the PAM constellation is symmetric around zero. 

Definition 3: We define P(x) to be the range of A such that x is the nearest neighbor to Ay, within the limits 
< A < A max = (M + T - 2)/ max t {y t } (where the limits are due to Theorem Q] and the fact that all x t are 
greater than zero for the modified received codeword). Formally, 

P(x) 4 {A | x £ NN{Xy), A e (0, A max )}. 

Note that each non-empty -P(x) corresponds to a distinct interval of the line yR. The proposed algorithm proceeds 
by enumerating these non-empty P(x)'s, by first enumerating their boundary points along the line yR. We then sort 
the boundary points so that the decision metrics for the corresponding x can be calculated in an iterative manner. 

For real-PAM, the boundary values of A can be shown to be given by Vt,b = for all t — 1, . . . ,T and 
b = 2, 4, . . . , M — 2 such that < v t ^ < A max (where the values of b come from the regular boundaries in the 
positive half of the PAM constellation X). We use Vo to denote the set of all (vt,bi t) pairs. 

We then sort the elements of Vo in ascending order of their Vfb value, and append the value (A max ,0) to the 
end of the ordered set (since this is the outer boundary of the segment of the line yR which needs to be searched, 
according to Theorem [T] where the second element of the pair is arbitrarily set to since it is not needed in the 
algorithm). We denote the newly ordered set by V, and index it by k, (i.e. we denote its kth element by (t/j. , tk)). 
These ordered values are shown on the example case of Figure [T] where the values of denote the distance along 
the line yR where the line crosses from one nearest neighbor region into the next. 
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We now show how V can be used to enumerate the codewords which need to be searched, and show how to 
calculate the corresponding decision metrics in an iterative manner. The algorithm can be visualized geometrically 
as searching along a segment of the line yM, by iterating in k. Whenever the line crosses from the nearest neighbor 
region of one lattice point (codeword) to the nearest neighbor region of another, we calculate the metric for the 
new lattice point. Note that in this context, the value of tk indicates the dimension of the boundary that is going 
to be crossed (in the T-dimensional space) when leaving the fcth segment of the line. 

The iterative search starts with the codeword xW = 1 = [ 1,1,..., 1]'; which has a corresponding decision 
metric L(l) = (l'y) 2 / ||1|| , where L(x) = (x'y) 2 / || x|| is the likelihood function in ©. We will use the symbol 
A as a marker for the most likely codeword, and we initially set it to A = z/i/2 (i.e. during the iteration process, A 
will be updated whenever a codeword is found to have a higher likelihood than any previously searched codeword, 
and the value of A will be chosen such that NN(X) gives the new codeword). 

The iteration proceeds by noting that each time a nearest neighbor boundary is crossed, only one element of the 
T-dimensional nearest neighbor codeword vector changes (since for real-PAM, the boundaries are straight lines, 
orthogonal to one of the dimensions, and parallel to all the others). Therefore the fcth codeword which needs to be 
considered, is calculated from the (k — l)th codeword, on an element-wise basis as follows: 

= < 4 " 1} ' ' 0TP ^ k - 1 (13) 



ip fc) + 1, forp = tk-i- 



We define ak = (x( fc ))'y and (3k — ||x( fc ) || 2 , and hence L(x^) = af./(3k is the decision metric for the fcth 
codeword considered. The values ak and (3k are calculated iteratively as follows, 

a k = a k -i + 2t/ tfe _ 1 (14) 

0k = Pk-l + &t h -y + 4- (15) 

If L(x( fe ') improves on the previous best codeword estimate then we update A in the interior of P(x' fe '), by 
setting \~{v k + iV-i)/2. 

Once all segments of the line have been searched, we have x opt = NN(Xy). Pseudo-code for the algorithm is 
given in Table ITTT1 

The complexity of the algorithm is a function of the number of intersection points v t ,b, '•<?., Ni = |Vo|. where 
|-| denotes set cardinality. Nj is upper bounded by (M/2 — 1)T, however in general it will be much less than this 
due to the restricted line search implied by Theorem Q] as shown by simulation in Section I VIII The sorting of Vo 
can be performed using standard sorting techniques in 0(Nj log Nj) [26]. The updates ( fT3l , ([Pil l and (fl5T l have 
complexity O(l), and the final calculation of x opt is of order T. Thus the overall complexity is dominated by the 
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sorting operation, and hence the complexity of the algorithm is of order 0(T log T); a significant improvement 
compared with an exhaustive search over all M T possible codewords in the codebook C(X,T). 



V. GLRT-Optimal QAM Detection For Complex-Valued Fading Channels 

This section presents a low complexity algorithm for GLRT-optimal noncoherent QAM detection over complex- 
valued fading channels. Similarly to the the real-PAM case, we first present a theorem that we will use to reduce 
the number of codewords that need to be examined, beyond the limitations imposed by Property Q] In the complex- 
valued channel case, the subspace of interest is the plane YR 2 , where Y was defined in ([8]). The theorem implies 
that only a limited extent of the plane needs to be searched; and that the extent depends on the largest element 
in y. We then propose a fast low-complexity algorithm to perform the search for QAM. We also show how PAM 
detection over complex-valued channels can be viewed as a special case of the QAM algorithm. 

A. Limiting the Search Space 

Theorem 2: For noncoherent detection of Af 2 -ary QAM codewords of length T over a complex-valued fading 
channel 

| Re {\ opl yt} | < M + 2T - 2, and 
|lm{A opt y t }| <M + 2T-2, 

for all £=1,2,...,T. 

Proof: Define the point v = A pt y, along with its corresponding real-valued representation v, as in (0. Also 
define n = argmax t {\v t \}. Note that if \v n \ < M then |Re {A opt y*}| ^ M and |Im {X opt y t }\ < M for all t and 
the theorem is satisfied. 

Now, consider the alternative case when \v n \ > M. Similarly to the real-PAM case, rearranging the GLRT-optimal 
channel estimate in (O gives (x opt )^(y — /i opt x opt ) = and hence 

(£apt)t( A °P t y-X°P t ) =0. (16) 



It follows that 



Hx^'ll 2 , 

A P , = H X II yfj^Opt 

|ytx°Pt| 2 



Combining this with the the fact that for any vector u G C T , the real-valued representation of the complex scalar 
y^u is Y'u, we obtain the real- valued representation of A pt as 



A opt - ll ~ l|2 9 Y , x 
llY'xIl 2 
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and therefore 

v = YA opt = -M_YY'x. 
I —'—II 

It follows that v'x = ||x|| 2 , '•<?. 

(x opt )'(v-x opt ) = 0. (17) 

Using (01 and the fact that C(X,T) contains all possible sequences { x | xt £ X Vt }, the elements of x opt can be 
determined on an element-wise basis as 

i° pt = arg mm \v t — x\ (18) 

for all t = 1, . . . ,2T where we recall from Section UTaI that X' = {±1, ±3, . . . , ±(M - 1)}. We now substitute 
( fT8l into ( fTTb which gives 

This is similar to (fT~2b in the proof of Theorem 1 . By following through the subsequent steps in the proof of Theorem 
1, and keeping in mind that the dimensions of the vectors are now of dimension 2T, we obtain ^ M + 2T— 2 
which implies that |Re {A opt y t }| < M + 2T - 2 and |Im {A opt y t }| < M + 2T - 2 for all t = 1, . . . , T. ■ 

B. QAM Algorithm 

In this Section we use Property Q] and Theorem |2] to develop a low-complexity algorithm for QAM detection. 
Property Q] implies that x opt can be found by calculating the metric in for only those x 6 J\f(C(X, T), Y), i.e. for 
only those x G C for which the plane YM 2 passes through its Euclidean nearest neighbor region. Furthermore, 
Theorem [2] implies that only a finite region of the plane need be considered. Conceptually, this is a direct extension 
of the real-PAM case shown in Figure Q] (considered previously). The difference being that Figure Q] shows the 
line yM, but we now have a plane YR 2 . Also the number of orthogonal dimensions doubles when considering 
complex-valued channels. We demonstrated this complex-valued channel QAM case in Figure |2] which is a two 
dimensional plot in the plane YR 2 . The parallel lines (at various angles) are the boundaries arising from the QAM 
constellation, and the shaded region indicates the nearest neighbor regions of codewords which need to be searched. 
The QAM search algorithm we present here, follows the same principles as the real-PAM algorithm of Section 
IIV-BI where instead of working with boundary points of line segments, we need to work with boundary edges of 
planar regions. The specifics of the algorithm are as follows. 

First, for ease of notation we modify the received codeword y by multiplying it by the complex scalar ym/\ym\, 
where m — argmaxt \y t \. This will mean that the mth element of y will be real- valued and positive. The true 
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(original) GLRT-optimal estimate of the channel can be be obtained by applying the reverse phase rotation to h° pt , 
while the optimality of the new GLRT-optimal codeword estimate is unaffected. 

Hence Theorem [2] implies that the search over the plane Y A is reduced to the segment of the plane for which 
| Ai|, |A 2 | < A max where A max = (M + 2T -2)/\y m \. Furthermore, as discussed in Section llTBl because of the tt/2 
phase ambiguity in square QAM constellations, there are four GLRT-optimal inverse channel estimates ±A opt , ±iA opt 
(with corresponding phase ambiguous GLRT-optimal codeword estimates). Hence, we only need to consider the 
square region of the plane 

5 = { A | Ai e (0, A max ), A 2 e [0, A max ) } (20) 

since exactly one of ±A opt , ±zA opt will exist in this region of the plane. Note that S is the shaded region in Figure 
|2] (mentioned previously). 

Similarly to the real-PAM case we make the following definition. 

Definition 4: We define P(x) to be the range of A e S such that x is the nearest neighbor to YA. Formally, 

-P(x) = {A | x e NN(Y\), A e S }. 

Note that each non-empty P(x) corresponds to a distinct region of the plane YR 2 . The proposed algorithm 
proceeds by enumerating these non-empty P(x)'s, by first enumerating their boundary vertices in the plane. These 
vertices are found by calculating the intersection of all the constellation-point boundary lines in the plane (e.g. as 
shown in Figure |2). The vertices are then used to calculate an interior-point inside each of the nearest neighbor 
regions in the shaded square S. The respective nearest neighbor codeword is calculated for each interior-point, and 
then it is only these points for which the likelihood metrics are calculated. Clearly, this is a significantly reduced 
search space compared with the space of all possible codewords. 

For QAM the vertices of the nearest-neighbor regions in the plane YM 2 can be found by first noting that, since 
x° pt can be given in on an element-wise basis as in ( fTSb . P(x) can be written as 

27' 

P(x) = f]{ AU 4 = aig min |(YA) t - x\, A e S} 

t=l 

\\ lu' iv (YA)t is the tth element of YA and we recall that X' = {±1, ±3, . . . , ±{M — 1)}. This can be written as 
the feasible region for the set of linear inequalities corresponding to the nearest neighbor region boundaries in X' 
for each element of x t , as 

2T 

P(x) = p| { A | l(x t ) < (YA), < u(x t ), A G S} 
t=i 

where l(x t ) and u(x t ) are the upper and lower nearest neighbor boundaries in the constellation X' . For t ^ 
{2m, 2m — 1} they take on values in the set {0, ±2, . . . , ±(M — 2), ±oo}. For t £ {2m, 2m — 1} we must 
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consider intersections with the boundary of S, and therefore in this case l(x_ t ) and u(x t ) take on values in the set 
{0, ±2, . . . , ±(M - 2), ±(M + 2T- 2)}. 

By including the square boundary of the region S, all non-empty P(x) are closed simply connected sets on the 
plane R 2 . Therefore, since P(x) is formed from linear inequalities it is a convex polygon in R 2 . For each P(x), 
denote P(x) as its polygonal boundary and V(x) as the vertices of the polygon. 

We now propose a method that enumerates all the vertices V^(x) for all non-empty P(x), and then uses these 
vertices to generate a point in the interior of all P(x), which is then used to obtain a unique codeword via finding 
the nearest neighbor codeword to that point. Consider the set of points { u ± efj, \v g V^x)}. If fi is some vector 
that is not parallel to any side of the polygon P(x), and if e is chosen sufficiently small, then at least one point in 
this set will be in the interior of P(x). Since the received symbol is subject to AWGN, and is therefore irrational 
with probability one, it follows that the arbitrary choice of \x = [ 1 1 ]' will almost surely guarantee this, given that 
e > is chosen sufficiently small. In practice, simply setting e to some small positive constant will be sufficient 
to ensure that a point in the interior of P(x) is enumerated. However, in Appendix [A] we present a technique to 
perform this in a strictly optimal fashion with complexity per vertex of 0(T). 

Since the vertices are shared by adjacent P(x), each vertex is only required to be enumerated once. We define 
the set of all vertices within or on the boundary of 5 as V = { u \ v g V^(x), P(x) ^ }. The set V can be 
enumerated as the the intersections of the lines Y_ t x v\ + Y_ t 2 V2 = b and Y_ t , x v\ + Y_ t , 2 V2 = b', for all pairs of 
t, t' and for all nearest neighbor boundaries b, b' in X'. That is 







Yt.i 


Yt.i 


-i 


b 


V2 




Yt,2 


y t , 2 _ 




V 



(21) 



for all * = 1, 2, . . . , 2T - 1, t' = t + 1, t + 2, . . . , 2T, and b, b' g B{t), where B(t) = {0, ±2, . . . , ±(M - 2)} 
if t £ {2m, 2m — 1} and for symbol indices t g {2m, 2m — 1} where we consider the square boundary B(t) = 
{0, ±2, . . . , ±(M - 2), ±(M + 2T - 2)}. 

To enumerate a point in each P(x), for each vertex v enumerated we calculate the points on the plane A + = 
v + efi, and A~ = u+e/j,. Then for each of these two points, if it is in the square S, we calculate the corresponding 
codewords NN(\ + Y) and/or NN(\~Y_) and the decision metrics in (0]). 

Pseudo-code is provided in Table |IV] 

The complexity of the algorithm is a function of the number of codewords examined, Nc, which is in turn 
a function of the number of vertices calculated. The number of vertices calculated in (|2T1 > corresponding to the 
intersections between lines in where b, b' is a boundary of X 1 and both b and b' are non-zero is T(2T — 1)[(M — 
l) 2 — 1]; for which at most two codewords are generated for a quarter of these intersections. For the intersections 
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of the boundaries of X' and the square S there are 2(2T — 2)(M — l) 2 intersections, which for one quarter of 
these intersections one codeword is generated. For the vertices at (0, 0) and (A max , A max ) one codeword is generated. 
Hence the total number of codewords examined is at most 

N c < T ~ [(M - l) 2 - 1] + (2T - 2)(M - l) 2 + 2. (22) 

Since the complexity of each codeword and decision metric calculation is of order T then the overall complexity is 
of order M 2 T 3 (which is linear in the constellation size M 2 ) a significant improvement over an exhaustive search 
over all M 2T possible codewords in the codebook C(X,T). 

A further reduction in computational expense, without any loss in optimality, can be achieved by enumerating 
only one out of each set of four phase ambiguous vertices. The technique is not presented here due to space 
constraints, however the number of non-zero vertices examined is reduced by a factor of 4 and 1/3 of the matrix 
inverse calculations in d2TT i are avoided. 

C. PAM Over Complex Channels 

PAM detection over complex fading channels can be viewed as a special case of complex-channel QAM, 
where there is zero imaginary component in the constellation. In this case, the search over the plane YR 2 can 
be restricted by extending the proof of Theorem |2] To do this, we note that the condition in (fT~6b holds, which 
implies that (x opt )'(Re {A opt y} — x opt ) = since x opt is always real-valued. The rest of the proof follows to give 
the result that |Re {A opt ?/ t }| ^ M + T — 2. This fact combined with Property Q] and the 7r phase ambiguity 
of PAM constellations, implies that we only consider codewords x = NN{Y_\) for A in the region S = 
{ A | < Ai < A max = (M + T — 2)/\y m \ }. 

The specifics of the Af-ary PAM algorithm are the same as for the M 2 -ary QAM case, with the exception that 
the calculation of (|2~TT i to obtain the vertices in the interior of the (|2~T1 i is only performed for all i = 1,3, ... , 2T— 1, 
t' = t + 2, t + 4, . . . , 2T, and b, V G B(t), where B{t) = {0, ±2, . . . , ±(M - 2)} if t ^ 2m - 1 and for Bit) = 
{0, ±2, . . . , ±(M - 2), M + T - 2} if t = 2m - 1. 

The total number of codewords searched can be shown to be at most 

N c < Ti - T 2 - [(M - l) 2 - 1] + (T - 1)(M - 1) + 1. (23) 

Since the complexity of each codeword and decision metric calculation is of order T then the overall complexity is 
0(T 3 ). In the following section, we will see that a simple suboptimal approach can achieve even lower complexity 
with near-optimal performance. 
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VI. Suboptimal Algorithms for Complex-Valued Fading Channels 

In this section, we propose even lower complexity suboptimal algorithms for detection of QAM and PAM over 
complex-valued fading channels. We directly use the GLRT-optimal algorithm for real-PAM from Section [IV] as 
the basis for the algorithms. 

A. Suboptimal PAM algorithm 

Since for PAM constellations, all constellation points lie along the real line in the complex plane, a suboptimal 
phase estimation technique combined with our GLRT-optimal algorithm for real-valued fading channels should be 
sufficient to provide near-optimal performance. This effectively reduces the search over the whole plane YR 2 for 
the GLRT-optimal case, to a search over a single line at the given estimated phase angle. 

We use the power-law estimator [27] which, for constellations exhibiting a rotational symmetry of ir radians, is 
simply 

1 T 

-Zj2vt (24) 

L 4=1 

where Z refers to the complex argument. Detection is performed by first rotating the received codeword y according 
to this estimate, and then detecting Re{e _J ^ PL y} using the GLRT-optimal algorithm of PAM over a real-valued 
fading channel. 

B. Suboptimal QAM algorithm 

Here we propose a suboptimal algorithm, which reduces the overall algorithmic complexity to 0(T 2 log T) 
by using 0(T) instances of the PAM detection algorithm presented in Section [IV] Instead of enumerating the 
intersections of lines on the (Ai, A2)-plane, as we did in Section [V-BI here we propose to use a modified version of 
the nearest-neighbor real-PAM line-search algorithm for L lines of the type presented in Section [IV] We generate 
these lines emanating from the origin into S (the shaded region in Figure |2), evenly spaced in angle. Of course, this 
does not guarantee that we fully enumerate Af(C(X),T) since a finite number of radiating lines can not completely 
cover a plane, however, we will see by simulation in Section IV-BI that the performance is close to the optimal. 

As in the optimal case, we multiply y by Vml\Vm\ so that y m will be real-valued and positive. In this suboptimal 
QAM case, this implies that we only examine points on the plane YA for A = [ Ai A2 ]' satisfying < |Ai|, |Aa| < 
A|f \ where A™ x = (M + 2T- 2)/\y m \. 

The L directions of the lines with respect to the direction of positive Ai have angles <1> = {0i, . . . , </>l} where 
(/)£ = (£— l)n/(2L). For each angle <f>g, we perform a nearest neighbor line search for the line with basis vector 
y.[ cos <pi smcf>£ ], as proposed in the suboptimal PAM algorithm in Section [VI- AI The search is performed for 
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the segment of the line Ay^ where AeR and < A < A™ ax where A™ ax = A™ ax / max {cos sin In this case 
the lines searches are performed for blocks of length 2T. 

There is of course a modification required to update the codeword metrics in terms of complex numbers. The 
first line search performed is for </>i = 0, and hence the line search is over Ay = Ay. In this case the intervals of 
the line P(x) are defined as, 



P(x) 4 {a I x e NN(X Zl ), Ae(0,Ar)}. 



Hence the algorithm works by enumerating and calculating the metric for all x G C(X', 2T) for which -P(x) is 
non-empty. 

In this case the set Vo of boundary points of the regions -P(x) is enumerated by calculating v t ,b = b/\y | for 
all t — 1, . . . , 2T and b = 2, 4, . . . , M — 2, (which are the nearest neighbor boundaries in the positive half of the 
constellation X'), and storing only those values of (vt.b, t) such that v t .b < A™ ax . The set of ordered boundary 
points V is again obtained by sorting, and (A™ ax , 0) is appended to V as the extent of the search. Recall that (vk,tk) 
are the fcth elements of V. 

The search through the codewords is initialized to the first codeword for the which the line segment passes through, 
which is given by x^ = s where s = sgn {y}- The likelihood update variables are initialized to a = (x^')^y 
and (3 = Hx^H 2 - To regenerate the optimal codeword, the values of A and 4> are initialized to A = v\/2 and 
</> = </>!= 0. 

The (fc + l)th codeword considered, x( fe+1 ), is calculated from the fcth codeword as 

^ +1) =^+2^. (25) 

To update the decision metric we define = (x^'^y and (3k — ||x^|| 2 , and hence L(x( fe )) = \a^\ 2 / (3^ is the 
decision metric for the fcth codeword considered. The values oik are updated as follows, If tk is odd, then at is 
updated as 

a k -i+2s tk _ 1 y( tk _ 1+1 y 2 , t k -i odd 
oi k - < (26) 

a k -i - 2is tk i y tk _ l/2 , i fe _ieven. 
The values of (3k are updated according to 

0k=0k-i+^s th _ 1 x th _ 1 +4. (27) 

If I/(x( fc )) = \ak\ 2 /(3k improves on the best codeword estimate then we store A = (v k + ^fe-i)/2 and = (pg. 

To start the next line search, y is multiplied by and the line search is then performed again for the new 
value of y. When all line searches have been performed, we calculate x opt = NN^e^y) for the original y. 
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Pseudo-code is provided in Table [V] 

The significantly reduced algorithmic complexity compared to the GLRT-optimal algorithm is governed by the 
number of line searches and the complexity of each line search. Since there are L phases, each performing a 
version of the real-PAM line-search algorithm of Section [TV] for the case M-ary PAM detection of 2T symbols. 
Thus N c ^ L(2T(M /2 - 1) + 1). From Section lV^Bl we have noted that the number of codewords in Af(C(X, T), y) 
is of order M 2 T 2 and thus L must be 0(T) for it to be possible that the majority of Af(C(X, T),y) is enumerated. 
Hence, if L is increased proportionally to T, the overall complexity of the algorithm is 0(T 2 log T). Note that 
however, the improved computational performance of the algorithm is largely due to being able to choose L small, 
which corresponds to avoiding examining a significant number of the x with associated P(x) being so small as to 
imply that x is not relatively close in angle to the plane YR 2 . We will see via simulation in Section [VTIl that small 
L {e.g. L = 4 for T = 7 16-QAM detection) can achieve near-optimal performance. 

VII. Simulation Results 

We now present simulation results to demonstrate the performance of the new PAM and QAM noncoherent 
reduced search lattice-decoding algorithms. Simulations are performed to obtain the codeword error rate (CER) as 
a function of SNR for noncoherent detection of 8-ary PAM and 16-ary square QAM. For both case, the simulations 
are performed for codeword lengths of T — 3 and 7 over a block Rayleigh fading channel where h is i.i.d. circularly 
symmetric complex Gaussian with unit variance. We have assumed that the phase ambiguities have been removed 
within each codeword, (for example, by the use of differential encoding [10]). 

Figure [3] presents results for 8-ary PAM for the GLRT-optimal plane search algorithm from Section [V-CI and the 
suboptimal phase-estimator plus line-search algorithm from Section IVI-AI We also compare with the suboptimal 
grid-search algorithm proposed in [20] and the quantization based receiver proposed in [11]. For the grid-search 
algorithm we use uniformly spaced channel phase estimates and the channel attenuation estimates are chosen 
uniformly from the CDF of the Rayleigh fading channel distribution. For fairness the number of channel attenuation 
estimates is adjusted so that the total number of channel estimates was kept equal to the maximum number of 
codeword estimates that potentially could be produced by our GLRT-optimal algorithm. Best results are obtained 
for choosing the channel phase estimates as and ir/2, and hence the fcth channel amplitude estimate is given 
by |M fc )| 2 = — log(l — fc/(l + \Nc/L])). For the quantization-based receiver (QBR) considered in [11], all 
possible sequences of (positive) amplitude levels are produced, and the sign of each symbol is then determined by 
symbol-wise coherent detection using uniformly spaced channel phase estimates (a channel amplitude estimate is 
not required since the signal amplitude is assumed known). For QBR, we again use the channel phase estimates 
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and 7r/2. 

Figure 2] presents the CER as a function of SNR, for 16-QAM transmission. Results are shown for the GLRT- 
optimal QAM algorithm given in Section [V-B| and the suboptimal algorithm given in Section [VI-BI We also compare 
with the grid-based algorithm, where best performance for a fixed number of codeword estimates was obtained using 
L = 4 channel phase estimates, which we also use for QBR. 

For both the PAM and QAM cases we see that the suboptimal line-search algorithms provide negligible per- 
formance loss compared to the GLRT-optimal algorithm. For the case of T — 3, where QBR is computationally 
possible, there is a noticeable performance loss. As discussed in Section Hl-BI divisor ambiguities result in a lower 
bound on the CER. Expressions for these lower bounds were provided in [25] and are also shown in the figure. 
Clearly, for high SNR, both of our GLRT-optimal algorithms and both suboptimal algorithms detection achieve these 
bounds for both PAM and QAM. As noted in [11], there is an inherent suboptimality introduced by quantizing 
the unbounded channel attenuation by employing a grid-search approach, and hence the performance is clearly 
inferior. Also, although QBR achieves near-optimal performance for T = 3, since the complexity of QBR increases 
exponentially with T is not possible to produce curves for T = 7. 

In Table |T] we present the relative computational complexities of the algorithms for the simulations in terms of the 
average number of codewords examined. The numbers in brackets indicate the number of codewords examined by 
the search if the restrictions on the search region provided by Theorems Q] and [2] are not applied (and are therefore 
slightly greater than the worst case values given in d22i i and (l23ll). We see that the suboptimal phase-estimator plus 
line-search approaches examine far fewer codewords yet obtains near-optimal performance, and that the complexity 
of QBR quickly becomes infeasible with increasing T. 





GLRT-Optimal 
Reduced Search 


Phase Estimator 
+ Line Search 


QBR 


Grid 
Search 


8-PAM T = 3 


132.3 (173) 


7.3 (10) 


128 


174 


8-PAM T = 7 


772.6 (1093) 


16.4 (22) 


32768 


1094 


16-QAM T = 3 


52.6 (87) 


22.9 (28) 


108 


88 


16-QAM T = 7 


311.8 (439) 


52.9 (60) 


8748 


440 



TABLE I 

Number of codewords examined for noncoherent PAM and QAM detection 

VIII. Reduced Ambiguity Transmission 
In this section we extend our new noncoherent detection algorithm to pilot assisted transmission (PAT) systems 
[6]. Unlike, standard PAT we propose to use the pilot symbol for noncoherent ambiguity resolution, rather than 
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simply for channel estimation. We propose to replace the pilot symbol of PAT with a symbol generated in the 
following way. Two bits are allocated for resolving the 7r/2 phase ambiguity of square QAM, and the remaining 
bits in the symbol are allocated to parity, remove divisor ambiguities and improve error performance. Therefore, 
this scheme has the same data rate as PAT and can be compared directly. 

With parity check bits in the codeword, we can now even further reduce the search space of our reduced search 
GLRT lattice-decoding algorithm by only considering codewords which satisfy a parity check. This significantly 
reduces the ambiguity problem. We will denote this parity-aided transmission scheme as reduced ambiguity (RA) 
transmission. 

An arbitrarily chosen parity check scheme might reduce the number of divisor ambiguities, however since the 
metric has a geometric interpretation it may be possible to design other parity-check schemes which both 
resolve ambiguities and optimize performance by providing a minimum angular separation between codewords. 
The resolution of ambiguities can be achieved, at least for 16-QAM, by using the following parity-check scheme. 
Two parity bits pi,p2 are calculated from the data bits {d\, d,2, ■ • ■ , c?2(t-i)} as follows, 

4(T-1) 

t=i 

2(T-1) 

P2 = l+ d ?t (29) 

where = denotes equality in GF(2). They are then mapped to the upper right-hand quadrant of the QAM 
constellation of the first (pilot) symbol in the codeword as follows: (00) >— > 1+ j, (01) >— > 1 + 3j, (11) >— > 3 + 3j 
and (10) >— » 3 + j. Effectively this means the first two bits of the first symbol of each codeword is chosen such 
that x 1: x 2 > 0, which removes the 7r/2 phase ambiguity, and the other two bits are parity bits, which in this case 
can be shown to completely remove the divisor ambiguities (see Appendix iBk 

Figure [5] presents the bit error rate (BER) as a function of SNR for detection of 16-QAM transmitted over a 
block independent phase-noncoherent AWGN channel. Again we have assumed that the phase ambiguities have 
been removed within each codeword. Results are shown for three codeword lengths T = 3,5,7. The figure shows 
curves for our new RA reduced-search GLRT-optimal algorithm, and compares them to standard PAT. Both schemes 
use a single pilot symbol per codeword; which for the RA scheme is generated as described above, and for PAT it 
is a symbol which has energy equal to the average energy per symbol. For PAT, the GLRT estimate of the channel 
(based on the pilot symbol) is used to perform GLRT-optimal data detection, while for RA lattice decoding we use 
our reduced search GLRT-optimal algorithm. Note that for PAT, the BER is independent of the codeword length 
T since it is a symbol-by-symbol detection scheme, whereas for RA lattice decoding the BER decreases as T 
increases since it is a sequence detection scheme. Clearly our scheme outperforms PAT increasingly with T. 
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Figure [6] shows the CER for the scenario of Figure [5] This serves to highlight even further the benefit from 
our lattice (sequence) decoding approach compared with PAT. For PAT, since bit errors occur independently on a 
symbol-by-symbol basis, the CER increases with T. However, for RA lattice decoding the CER decreases. Also the 
figure highlights the advantage of using pilot symbols, compared with fully noncoherent transmission, by observing 
that the SNR range is significantly lower than for Figures [3] and [4] 



In this paper we developed polynomial-time lattice-decoding algorithms for noncoherent block detection of PAM 
and QAM. Faster suboptimal algorithms for QAM were also presented which have excellent agreement with the 
optimal algorithms. A reduced ambiguity transmission scheme was introduced which was shown to outperform pilot 
assisted transmission over the phase noncoherent channel. 



A. Strictly Optimal Calculation of Interior Points 

For each non-empty region P(x), there exists a vertex v € V(x) and small scalars v + , v~ > 0, such that either 
u + = v + [ v + ]' or v = v + [ v~ ]' is in the interior of P(x). 

Suppose the first case is true. Now, the line v +7[ 1 ] 'intersects an edge of the boundary of -P(x), and we will 
call this intersection point [i. We propose to choose v + = 7 > so that v + is the midpoint of v and fi. Defining 
u t as the tth element of u = Y v, where Y is defined by the original received vector y, we can calculate v + as 
follows, 



This process will in general always calculate a point in each non-empty -P(x). However, to avoid calculation 
problems we first rotate v by y m /\y m \, so that the vectors [ v + ]' and [ v~ ]' are not parallel to any of the 
edges of P(x) (e.g. those that are part of S). This rotation is later reversed, so that the points calculated are in the 
original coordinates. 



IX. Conclusion 



Appendix 




Note that almost surely y ^ 0. Similarly, using the line v — 1 ]' we calculate v 



7 > as 
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B. Removal of Ambiguities in 16-ary QAM 

In this section we show that the proposed RA pilot symbol approach (using parity checks, as discussed in Section 
IVIIB totally removes both the phase and divisor ambiguities otherwise inherent in a noncoherent detection system 
(as discussed in Section Hi- At . 

We start by recalling that the proposed parity scheme involves calculating the parity bits from the data bits dt for 
t = 1,...,4T as follows. 

4(T-1) 2(T-1) 

Pi = 1 + Yl * P 2 = l+ J2 d 2t (30) 

1=1 £=1 

where = denotes equality in GF(2). 

The data and parity bits are then mapped to the symbols as shown in Table HH where we recall from the definition 
in (0 that x 2t _ 1 = Re {x t } and x 2t — Im {x t }. 





2* 


P1P2 


Xl 


(pilot symbol) 


00 


-3 


00 


H 


- i 


01 


-1 


01 


1 H 


-3i 


11 


1 


11 


3H 


-3i 


10 


3 


10 


3H 


- i 



TABLE II 



Mapping of data and parity bits. 

Since x\ is constrained to have positive real and imaginary components, the phase ambiguity has been removed. 
It remains to show that all divisor ambiguities have also been removed. 

To do this, we first define the associates of a Gaussian integer g to be the elements of the set A(g) = 
{g, gi, —g, —gi}. We also denote A(g) T to be a codeword of length T composed of only elements of A(g). 
For 16-QAM, it can be easily shown that a necessary condition for a divisor ambiguity to exist is that there exists 
codewords xW 6 A(gi) T , x^ 2 ) £ A(g2) T for some gi, g 2 G X = {1 + i,3 + 3i,3 + i,l + 3i} such that g\ ^ g 2 . 

For a codeword x and some g E X, we define Ni,N 2 , N3 and as the number of occurrences in a codeword 
of each of the four possible rotations of g in the codeword, that is g,gi, —g and — gi respectively. Noting that the 
phase ambiguity has been removed (since x\ is constrained to have positive real and imaginary components), a 
sufficient condition for two codewords to be unambiguous is that there exists some t, such that the <th symbols 
from the two codewords are in different quadrants of the complex plane. It follows then, that a sufficient condition 
for two codewords to be unambiguous is that they do not have the same values of Ni to A4. 

We now use this property on Ai to A4 to show that for arbitrary T, it is not possible for two ambiguous 
codewords x^ 6 A(gi) T , x^ 2 ) € A(g 2 ) T , to satisfy the parity check (f30b for any gi,g 2 G X such that g\ ^ g 2 . 
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We consider each g G X (where we have previously defined X = {1 + i, 3 + 3i, 3 + i, 1 + 3i}) in turn, showing 
that all codewords x G A(g) T that satisfy the parity check, are distinguishable in phase from all parity-satisfying 
codewords considered up to that point. For 16-QAM this process involves considering the four Gaussian integers 
1 + i, 3 + 3i, 3 + i and 1 + 3i in turn, as detailed in the following four cases. 
Define x D to be the data codeword component of x, i.e. x = [0:2 • • • xt]' ■ 
. Case x D g A(l + i) T - x : 

In this case, we show that there does not exist any x G A(l + i) T that satisfies the parity check. Using 
Table HIl the bits (du-z ■ ■ ■ du) are mapped to the symbol xi = x 2e _ 1 + ix 2e G x D in the following way: 
(1111) >-»• 1 + i, (0111) >-> -1 + i, (0101) -1 - i and (1101) >-► 1 - i. Clearly from <|29), p 2 = 1, and 
therefore the pilot symbol x% will be either 1 + 3i or 3 + 3i. It follows that x ^ A(l + i) T . 
. Case x D G A(3 + 3i) T - 1 : 

In this case, we show the conditions under which a codeword x G A(3 + 3i) T satisfies the parity check. The 
associated bit mappings are (1010) >-► 3 + 3i, (0010) ^ -3 + 3i, (0000) >-► -3 - 3i and (1000) 3 - 3i. 
Clearly, p± = 1 + N2 + N4 and p 2 = 1. Therefore, 



Furthermore it follows that x G A(3 + 3i) T only if N 2 = N 4 . 
. Case x D G A{3 + i) T ~ 1 : 

In this case, we show the conditions under which a codeword x G A(3 + i) T satisfies the parity check, and 
show that under these conditions there does not exist any ambiguous codeword from A(3 + 3i) T , i.e. from the 
previous case. The bit mappings are (1011) >-> 3+i, (0110) >-► -l+3i, (0001) >-» -3-i and (1100) >-» 1-31 
In this case, pi = 1 + Ni + N 3 and p 2 = 1 + Ni + N 2 + N 3 + N 4 = 1 + T - 1 = T. If T is odd, then 
p 2 = 1 and therefore x\ G {1 + 3i, 3 + 3i} and therefore x ^ A(3 + i) T . If T is even then p 2 = and 
pi = 1 + iVi + 7V 3 = iV 2 + Ni. Therefore 



It follows that x G A(3 + i) T only if N 2 ^ N 4 and T is even. Recall that in the previous case, valid 
parity satisifying codewords only occurred if N 2 = N4. Therefore an ambiguity will not occur between two 
codewords x G A(3 + i) T and x^ 1 ) G ^4(3 + 3i) T since they will be distinguishable in phase. 
. Case x D G A{l + 3i) T - 1 : 

In this case, we show the conditions under which a codeword x G A(\+3i) T satisfies the parity check, and show 




(01) i.e. if N 2 ^ 7V 4 , 



(11) i.e. if N 2 = N4. 




1 + if { PlP2 ) = (00) i.e. if N 2 = N4. 



3 + i if (pm) = (10) i.e. if N 2 £ N 4 , 



IEEE TRANSACTIONS ON SIGNAL PROCESSING, ACCEPTED TO APPEAR (ACCEPTED NOV. 2006) 



23 



that under these conditions there does not exist any ambiguous codeword from either A(3 + 3i) T or A(3 + i) T , 
i.e. from the previous two cases. The bit mappings are (1110) >— > 1 + 3i, (0011) >— ► —3 + i, (0100) >— > — 1 — 3i 
and (1001) >-» 3 - i. Here, p x = 1 + TVi + N 3 and p 2 = T. If T is even, then p 2 = and therefore 
xi e {1 + i, 3 + i} and no ambiguity occurs. If T is odd then pi = 1 + N 2 + N4 and p 2 = 1- Therefore, 



It follows that x e A(l + 3i) T only if iV 2 ^ N4 and T is odd. Clearly, these conditions are different to those 
to the previous two cases and therefore no ambiguities exist. 
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1 


begin 




2 


s : 


= sgn y; 


// Store sign of each y t 


3 


y 


= s o y; 


II Make each y t positive 


4 


B n» ._ M + j, _ 2 . 




5 


m 


:= argmaxt {yt}; 




6 


A° 


" ■- (M + 2T -2)/\y„ 


,|; // Search region: < A < A m ™ 


7 


Vo 


:= 0; // Calculate and store P(x) boundary points 


8 


for t := 1 to T do 




9 


tor all 6 e {2,4, . . . ,M- 


2} do 


10 




v := b/y t ; 




11 




if v < A m "; 




12 




Vo := {V,(M)}; 




13 




else break; 




14 


end for all; 




15 


end for; 




16 


V 


= sort(Vo); 


// Sort Vo in ascending order of v 


17 


V 


= {V,(A m »,0)}; 




18 


X 


= [11 ... l]'; 


II Initialize data estimate 


19 


a 


= x'y; 


// Initialize likelihood terms 


20 





= llill 2 ; 




21 


L 


= « 2 //3; 




22 


A 


= V(l,l)/2; 




23 


for k := 1 to |V| - 1 do 


// Iteratively examine likelihoods 


24 


t 


:= V(fc, 2); 




25 


a := a + 2y t ; 


// Update likelihood terms 


26 


(3 := P + Ax t +A; 




27 


&t ■= x t + 2; 


// Update x 


28 


if a 2 //3 > L 


// If better x found 


29 




L := a 2 /ft 


// Update likelihood 


30 




A := (V(fe, 1) + V(fe + 1, l))/2; // Store point in P(x) 


31 


end if; 




32 


end for; 




33 


return x u P' := so NN(\y); 





TABLE III 



M-ARY REAL-PAM NONCOHERENT LATTICE DECODING ALGORITHM 
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1 


begin 


2 


m := argmax t { \y t }; 


3 


y :— / \y-m\)y> ^ Rotate y so that y m is purely real 


4 


B tmx ._ A/ _|_ 2 T - 2; 


5 


A mdx :— £? mdx / y m |; II Search boundary (Thm. l2l. 


6 


x bL ' s ' := NN((e + ie)y); // Codeword near origin 


7 


L m " := L(x bos '); // L'hood L(x) = x 1 y 2 / llxll 2 

V / ' \ / •//MM 


§ 


U . — 1") ^5 ■ ■ ■ i — J' rUSlllVC I'l.l'l LKJUllLlal ICS 




/ / Calculate only intel section points (i.e. vertices) in 




// first cju3.dr3.nt using ^^^^ 1 educing number of 




// NN boundaries Z3i, t3'2 and then rotating. 


9 


e nr f . — 1 * ft 97-1 _ 1 j n 

Mil L . ± 119 _ J. J. UU 


10 


B\ ■— B~ 


11 


if r c SOt-r, 1 %n\ than R, /R, R max \. 

11 I, t \&ltl — lllcll Lj\ . — 1 Ul , JJ f, 


12 


f rtr f _|_ 1 t,\ OT Hn 

1UI t . — L -J- J. IO _A HO 




K?„ K7. 

02 ■ — 


14 


11 t t \zm — 1, zm} men 02 : — 1^2, /■> 


IS 


S := — *' 1 — *' 2 ; // Matrix in {TQ 

L— £4', 2 J 


16 


for all eBi 


17 


for all 6 2 £B 2 




// Calculate intersection point; 


18 


v := Real-To-Complex (S[ bi 6 2 


19 


^ := Rotate-To-First-Quadrant(i/); 


20 


for all s e {-1, 1} 


21 


A ;= w + s(e + i.e); // Point in some partition 




// Check that A is in reduced search region 


22 


if < Re {A} < A max and 




^ Im{A} < A 108 * then 


23 


x := NN(Xy); 11 Calculate NN 


24 


if L(x) > L max // If better x found 


25 


x best :— x; // Update codeword estimate 


26 


j-msx ._ x,(x); // Update likelihood 


27 


end if; 


28 


end if; 


29 


end for all; 


30 


end for all; 


31 


end for all; 


32 


end for; 


33 


end for; 


34 


return x opl := x best ; 



TABLE IV 

M 2 -ARY SQUARE QAM NONCOHERENT LATTICE DECODING ALGORITHM 
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1 


begin 




2 


L r= 0; 


ii Initialize likelihood 


3 


A™ 2 " — (M + 2T - 2)/max t \y t \; 


4 


for I - 1 to L then 




5 


// Search region: < A < A max (Theorem[2) 


6 


A milx :— A u lax / mill || cos 


fel. I sin fr|}; 


7 


Vo — 0; // Calculate and store -P(x) boundary points 


8 


for t = 1 to 2T then 




9 


for all 6 £ {2, 4, . . . , M 


- 2} then 


10 


v-b/\y t \; 




11 


if v < A"™; 




12 


Vq := {V ,(M)}; 




13 


else break; 




14 


end for all; 




IS 


end for; 




16 


V :^ort(V ); 


// Sort Vo in ascending order of u 


17 


V := {V, (A™\0)}; 




18 


s := sgn{y}; 




19 


x :— s; 


II Initialize data estimate 


20 


a :— x^y; 


// Initialize likelihood terms 


21 


■■= ll^ll 2 ; 




22 


it a 2 / 13 > L 


// If better x found 


23 


L:=a 2 /f3; 


// Update likelihood 


24 


A := V(l)/2; 




25 


:= lir/(2L); 




26 


end if; 




27 


for k := 1 to |V| - 1 do 


// Iteratively examine likelihoods 


28 


t := V(fe,2); 




29 


if t' is odd then 




30 


a := a + 2s t j/ (t+ i )/2 




31 


else 




32 


a := a - 2is_ t y t / 2 ; 




33 


end if 




34 


:= p + 4s t x t + 4; 




35 


x_ t '■— x t -\- 2s £ ; 




36 


if |a| 2 //^> L 


// If better x found 


37 


L := \ a \ 2 /{3; 


// Update likelihood 


38 


A :^ (V(fc, 1) + V(fe + 1, l))/2; // Store point in P(x) 


39 


(/> :^ £tt/(2L); 




40 


end if; 




41 


end for; 




42 


y := ye2i ; 


// Rotate y for next line search 


43 


end for; 




44 


return x°P l := NN(Xe J<p y); 





TABLE V 

SUBOPTIMAL M 2 -ARY SQUARE QAM MULTIPLE LINE-SEARCH NONCOHERENT DETECTION ALGORITHM 
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10 - 



X 




x l 



Fig. 1, Illustration of noncoherent detection of 8-ary PAM for T = 2. The dots are all the (two dimensional) PAM codewords in the positive 
quarter-plane, and the angled line is yM, for an example received codeword y. The shaded regions indicate the nearest neighbor regions 
of points which need to be searched. That is, they are in Af(C(X, T), y) (from Property [T}, and they correspond to values of A less than 
A max = ( A/ + T _ 2)/ maxt \ yt \ = Ml max t \y t \ (from Theorem [TJ. 




Fig. 2. Plot of partitions P(x) on the R 2 plane for 16-ary QAM detection of a sequence of length T = 3 for the received vector y = 
[ -0.1076 - 0.4728i, -0.7002 - 0.0968i, -1.1228 + 0.4955i ]. The bold square corresponds to the search boundary S. 
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New GLRT-Optimal Reduced-Search 
Phase-Estimator + Line Search 
Grid Search [20] 
QBR (exponential complexity) 
Ambiguity lower bound 




Fig. 3. Plot of Codeword Error Rate (CER) as a function of SNR for an 8-ary PAM system. 
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■ Phase-Estimator + Line Search (Section VLB) 
Grid Search [20] 
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Fig. 4. Plot of Codeword Error Rate (CER) as a function of SNR for a 16-ary square QAM system. 
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SNR (dB) 

Fig. 5. Comparison of Bit Error Rate (BER) as a function of SNR for 16-QAM for PAT versus RA transmission. 




SNR (dB) 

Fig. 6. Comparison of Codeword Error Rate (CER) as a function of SNR for 16-QAM for PAT versus RA transmission. 



